Topic and language specific internet search engine
نویسندگان
چکیده
In this paper we present the result of our project that aims to build a categorization-based topic-oriented Internet search engine. Particularly, we focus on the economic related electronic materials available on the Internet in Hungarian. We present our search service that harvests, stores and makes searchable the publicly available contents of the subject domain. The paper describes the search facilities and the structure of the implemented system with special emphasis on intelligent search algorithms and document processing methods.
منابع مشابه
Categorization-based Topic-oriented Internet Search Engine
In this paper we present the result of our project that aims at build up a categorization-based topic-oriented Internet search engine. Particularly, we focus on the economic related electronic materials available in Hungarian on the Internet. We present D. Tikk et al. Categorization-based Topic-oriented Internet Search Engine 234 our search service that harvests, stores and makes searchable the...
متن کاملBuilding Topic Specific Language Mo Competitive Mo
The ability to build topic specific language models, rapidly and with minimal human effort, is a critical need for fast deployment and portability of ASR across different domains. The World Wide Web (WWW) promises to be an excellent textual data resource for creating topic specific language models. In this paper we describe an iterative web crawling approach which uses a competitive set of adap...
متن کاملThe Core of a Topic-Specific Search Engine: How to Create It
A technique for gathering scientific, narrow topic-related documents from the Internet is presented. It has been successfully applied to compile a large Japanese collection of algorithms and their applications. Key-Words: Search Engine, Similarity Metrics, Crawler
متن کاملSubwebs for specialized search
We describe a method to define and use subwebs, user-defined neighborhoods of the Internet. Subwebs help improve search performance by inducing a topic-specific page relevance bias over a collection of documents. Subwebs may be automatically identified using a simple algorithm we describe, and used to provide highly-relevant topic-specific information retrieval. Using subwebs in a Help and Supp...
متن کاملThe Mechanics of a Deep Net Metasearch Engine
The Deep Net refers to the thousands of topic-specific search engines on the Internet, including those that are inaccessible to traditional crawler-based search engines. Commercial metasearch engines have been slow to provide a simple, universal interface to these smaller topic-specific search engines. Turbo10 has developed a commercial metasearch engine that connects to these resources en mass...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Acta Cybern.
دوره 18 شماره
صفحات -
تاریخ انتشار 2007